Modular Graph Transformer Networks for Multi-Label Image Classification

نویسندگان

چکیده

With the recent advances in graph neural networks, there is a rising number of studies on graph-based multi-label classification with consideration object dependencies within visual data. Nevertheless, representations can become indistinguishable due to complex nature label relationships. We propose image framework based transformer networks fully exploit inter-label interactions. The paper presents modular learning scheme enhance performance by segregating computational into multiple sub-graphs modularity. proposed approach, named Modular Graph Transformer Networks (MGTN), capable employing backbones for better information propagation over different guided transformers and convolutions. validate our MS-COCO Fashion550K datasets demonstrate improvements classification. source code available at https://github.com/ReML-AI/MGTN.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Regional Gating Neural Networks for Multi-label Image Classification

This paper proposes a novel deep learning framework for multi-label image classification, namely regional gating neural networks (RGNN). The motivation is two folds. First, global image features (including CNN based features) ignore the underlying context information among different objects in an image. Consequently, people attempt to use information from objectness regions. However, current ob...

متن کامل

Matrix Completion for Multi-label Image Classification

Recently, image categorization has been an active research topic due to the urgent need to retrieve and browse digital images via semantic keywords. This paper formulates image categorization as a multi-label classification problem using recent advances in matrix completion. Under this setting, classification of testing data is posed as a problem of completing unknown label entries on a data ma...

متن کامل

Proximity-based Graph Embeddings for Multi-label Classification

In many real applications of text mining, information retrieval and natural language processing, large-scale features are frequently used, which often make the employed machine learning algorithms intractable, leading to the well-known problem “curse of dimensionality”. Aiming at not only removing the redundant information from the original features but also improving their discriminating abili...

متن کامل

Multi-label Image Classification with A Probabilistic Label Enhancement Model

In this paper, we present a novel probabilistic label enhancement model to tackle multi-label image classification problem. Recognizing multiple objects in images is a challenging problem due to label sparsity, appearance variations of the objects and occlusions. We propose to tackle these difficulties from a novel perspective by constructing auxiliary labels in the output space. Our idea is to...

متن کامل

Graph Convolutional Networks for Classification with a Structured Label Space

It is a usual practice to ignore any structural information underlying classes in multi-class classification. In this paper, we propose a graph convolutional network (GCN) augmented neural network classifier to exploit a known, underlying graph structure of labels. The proposed approach resembles an (approximate) inference procedure in, for instance, a conditional random field (CRF), however wi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2021

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v35i10.17098